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ABSTRACT 

We investigate the importance of projection effects in the identification of galaxy clus¬ 
ters in 2D galaxy maps and their effect on the estimation of cluster velocity dispersions. 
From large N-body simulations of a standard cold dark matter universe, we construct 
volume-limited galaxy catalogues that have similar low-order clustering properties to 
those of the observed galaxy distribution. We then select clusters using criteria tai¬ 
lored to match those employed in the construction of real cluster catalogues such as 
Abell’s. We find that our mock Abell cluster catalogues are heavily contaminated 
and incomplete. Over one third (34±6 per cent) of clusters of richness class R>1 are 
miclassifications arising from the projection of one or more sub-clumps onto an intrin¬ 
sically poor cluster. Conversely, 32±5 per cent of intrinsically rich clusters are missed 
altogether from the R>1 catalogues, mostly because of statistical fluctuations in the 
background count. Selection by X-ray luminosity rather than optical richness reduces, 
but does not completely eliminate, these problems. Contamination by unvirialised 
sub-clumps near a cluster leads to an overestimation of the cluster velocity dispersion 
which can be very substantial even if the analysis is restricted only to galaxies close to 
the cluster centre. Thus, the distribution of cluster masses - often used to test cosmo¬ 
logical models - is a highly unreliable statistic. The median value of the distribution, 
however, is considerably more robust because the main effect of contamination is to 
create an artificial tail of high velocity dispersion clusters. Improved estimates of the 
cluster velocity dispersion distribution require constructing new cluster catalogues in 
which clusters are defined according to the number of galaxies within a radius about 
three times smaller than the Abell radius. 

Key -words: galaxies: clusters: general - cosmology: theory 


1 INTRODUCTION 


Clusters of galaxies are a major source of cosmological in- 
)n. Bocauso of thoir largo luminosity they can bo do 


formati on, oucause ui Ijiiuir largu imiiiiiosiiy 
tected, laud their properties can he measured with relative 
ease, out to large distances. This makes it possible to exploit 
their special characteristics as the most massive nonlinear 
objects in the Universe. In hierarchical clustering theories 
for the formation of structure, clusters are associated with 
the rare high peaks in the primordial density field on scales 
of a few megaparsecs. As a result, their mass and abundance 
are very sensitive to the amplitude of mass fluctuations on 
these scales (Frenk et al. 1990; White, Efstathiou & Frenk 
1993; Viana & Liddle 1996; Eke, Cole & Frenk 1996b). The 
epoch of cluster formation and the rate at which the clus¬ 
ter population builds up is, sim ilarly, a strong function of 


the mean density parameter, D (Lacey & Cole 1993 


Eke et 


al. 1996b), as is their degree of internal substructure (Rich- 
stone, Loeb & Turner 1992; Mohr et al. 1995; Wilson, Cole 


& Frenk 1996). The clustering properties of clusters depend 
primarily on the shape of the power spectrum of mass fluc- 
tuatio ns and have been a subject of much debate f or over 20 
years (Bahcall & Soneira 1983; Dalton et al. 1994; Eke et al 


1996a). Finally, rich clusters have recently been used to map 


the local density field (Plionis et al. 1996, in preparation). 


The use of galaxy clusters as cosmological diagnostics 
relies on the availability of statistical samples, selected ac¬ 
cording to a well-defined property, such as richness, mass, or 
X-ray temperature. Traditionally, the source of such samples 
has been Abell’s (1958) cluster catalogue. The integrity of 
the Abell catalogue, however, has been questioned over the 
years (e.g. Fesenko 1979a,b; Lucey 1983; Frenk et al. 1990). 
Even Abell himself made it quite clear that the complete¬ 
ness and homogeneity of his catalogue were suspect. Partly 
to overcome these shortcomings new cluster catalogues were 
constructed in the early 1990s based, as Abell’s, on pho¬ 
tographic material, but replacing eye-ball identifications by 
automated scans of digitized plates. These procedures have 
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produced the APM (Dalton et al. 1992) and Edinburgh- 


Durham cluster catalogues (EDCC; Lumsden et al. 1992). 
Computer manipulation of the galaxy images has allowed 
a degree of uniformity and repeatability to be reached that 
was impossible in Abell’s days. Nevertheless, Abell clusters 
remain the best studied, and Abell’s catalogue the main 
source from which samples are drawn for statistical stud¬ 
ies. 

Whether by eye or by computer, catalogued clusters 
are identified as two-dimensional objects, seen against a 
strongly clustered background. The enormous column depth 
to a cluster makes projection effects inevitable. Indeed, spec¬ 


troscopic follow-up of Abell and AGO clusters (Abell et al 


198S) cften reveals several clumps of galaxies lined up in 


the direction to a rich cluster (see Katgert et al. 1996 for 
a recent study of a large sample). Although these clumps 
enhance the apparent richness of the cluster, the dominant 
concentration along the line-of-sight is often rich enough to 
emit X-rays (e.g. Briel & Henry 1993). This fact alone, how¬ 
ever, says little about the importance of projection effects 
or the completeness of optically selected cluster samples. 

Since the abundance of clusters declines very rapidly 
with richness or mass, even a small amount of contamination 
can compromise statistical studies in which completeness 
and/or homogeneity are required. The cluster two-point cor¬ 
relation function is a good example. Values for the correla- 


strongh 

r advocated bv various workers (Bahcall & Soneira 

198^ 

3stman. Geller & Huchra 1992 

; Efstathiou et al. 19921 

Nichol et al. 1992|; |Dalton et al. 1994). According to Bahcall 


& West (1992) the differences are due to the different limit¬ 
ing richnesses of the various samples but others have claimed 
that they are due (at least in part) to a misclassification of 
poor clusters which are placed into a higher Abell richness 
class as a result of con t amination by the halos of rich clus- 


ters (Sutherland 1988; iDekel et al. 198£|; 

Efstathiou et al. 

1992|)!^ke et al. ( 1996a|) have argued that 

a combination of 


different selection procedures and a richness dependence of 
the clustering strength also contributes to these differences. 

While even distant sub-clumps artificially enhance the 
apparent richness of a cluster, subclustering in its immedi¬ 
ate vicinity causes its velocity dispersion to be overestimated 
(e.g. Frenk et al. 1990). This effect is the likely cause of the 
poor correlation between the X-ray temperat ure and veloc- 
ity dis persion o f the most massive clusters (David, Jones 
& Forn an 199£| ) and it vitiates comparisons between clus¬ 
ter masses de termined from X-ray , optical and gravitational 
lensing data (Fahlman et al. 1994). The distribution of clus¬ 


ter masses or velocity dispersions has been used as a discrim¬ 
inant of different cosmological models (e.g. Weinberg & Cole 
1992; Bahcall & Cen 1993; Lubin et al. 1996). These com¬ 
parisons tend to rely heavily on the behaviour of the high 
mass end of the distribution which, unfortunately, is particu¬ 
larly sensitive to contamination due to substructure. Masses 
derived from X-ray data are more reliable (Evrard, Metzler 
& Navarro 1996; but see Balland & Blanchard 1995), but 
since cluster samples are almost invariably selected by their 
optical properties, the inferred distributions of X-ray prop¬ 
erties are also subject to the kind of uncertainties discussed 
above. Furthermore, by virtue of the fact that optically se¬ 
lected cluster catalogues all have a lower cut-off in richness. 


there is an in-built bias in these samples against low mass 
clusters. 

It seems clear that the use of cluster properties as cos¬ 
mological diagnostics requires a detailed understanding of 
the biases introduced by projection effects and contamina¬ 
tion in cluster catalogues. The aim of this paper is to set up a 
methodology for quantifying such biases using mock galaxy 
catalogues constructed from N-body simulations. In this pa¬ 
per we analyze mock catalogues constructed from standard 
D = 1 cold dark matter (CDM) simulations and we con¬ 
centrate on Abell clusters. Our procedures, however, can 
readily be extended to other cosmologies and other clus¬ 
ter catalogues. We also use our mock catalogues for testing 
alternative procedures for defining clusters and estimating 
their properties which avoid some of the biases present in 
Abell’s catal ogue. Our work extends previous analyses by 
Frenk et al. (1990) and White (1991,1992) who used similar 
techniques. An earlier assessment of the completeness and 
contamination of Abell’s catalogue, u sing M onte-Carlo sim¬ 
ulations, was carried out by Lucey (1983). He found that 
between 15 and 25 per cent of rich Abell clusters have a 
true membership that is less than half the number observed. 
However, his models contained no dynamical information, 
and they did not take into account the clustering properties 
of galaxies and clusters. 

In Section 2 we give technical details of our simula¬ 
tions and our method for constructing galaxy catalogues 
from which mock Abell catalogues are derived. The galaxy 
catalogues encapsulate the essential characteristics of the 
real situation, although we have not tried to reproduce the 
observed properties in every detail. Section 3 presents an 
analysis of the completeness of the cluster catalogues. By 
identifying groups along the line-of-sight to each cluster, we 
determine the properties of the main concentration of galax¬ 
ies, and those of clumps projected onto the cluster. The var¬ 
ious categories of contamination we identify are described in 
Section 4. In Section 5 we discuss how projection effects in¬ 
fluence estimates of cluster velocity dispersions derived from 
radial velocity measurements and illustrate how careful clus¬ 
ter selection and interloper removal can improve upon the 
accuracy of these estimates. In Sectio 6 we apply a popular 
statistical test for substructure to our data and demonstrate 
its potential for flagging clusters whose velocity dispersion 
estimates are strongly affected by substructure. In Section 7 
we present the cluster-cluster correlation function, and il¬ 
lustrate how overlapping clusters can affect the amplitude 
of this function. Finally, we present a summary of our main 
results in Section 8. 


2 METHODS 
2.1 Simulations 

Our analysis employs the set of 8 st andard CDM N-body 
simulations described by Eke et al. (1996a). Each simula¬ 
tion followed 128® particles in a comov ing p eriodic box 256 
h“®Mpcp] on a side, using Couchman’s (1991) adaptive P®M 

'M©. 


code. The mass per particle is therefore 2.2 x 10®®/i ® 


* We write Hubble’s constant as Hq = lOOhkms ^Mpc ^ 
throughout. 
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Figure 1. The real space two-point correlation function of galax¬ 
ies in our catalogues compared with observations. The solid circles 
show ^{r) obtained by averaging over 8 independent simulated 
galaxy catalogues. The open circles show Baugh’s (1996) depro¬ 
jection of the angular correlation function of the APM galaxy 
catalogue. The triangles show the real-space correlation function 
derived by Loveday et al. (1995) from the Stromlo-APM galaxy 
redshift survey. 


The force softening (for an equivalent Plummer potential) 
was fixed at ~ 65/i~^kpc in comoving coordinates. 

Initial conditions were laid dow n usin g the CDM trans¬ 
fer function given by Bardeen et al. (1986) (hereafter BBKS) 
for adiabatic fluctuations in a universe with a negligibly 
small baryon density and h = 0.5. Initial velocities and dis¬ 
placements were computed using the ZePd ovich approxima- 
t ion a s outlined by Efstathiou et al. (1985) and Davis et al. 
(1985). The expansion factor a was chosen so that as = a, 
where as is the rms amplitude of mass fluctuations in a 
sphere of radius 8 /i“^Mpc. Each simulation was started at 
a=0.05 and halted at a=0.63, using a time-step Aa=0.002. 
Only the final epoch a=0.63 was used for the present study. 
According to White et al. (1993), Viana & Liddle (1996) and 
Eke et al. (1996b) this normalization is required to obtain 
approximately the observed abundance of rich clusters. 


2.2 Constructing Galaxy Catalogues 

It is not yet possible to carry out simulations of cosmolog¬ 
ical volumes that are large enough to contain many galaxy 
clusters and have enough resolution to model the dissipa¬ 
tive processes of star and galaxy formation. Attempts to 
introduce galaxies into numerical simulations are therefore 
subject to considerable uncertainty. We have implemented 
a scheme that produces galaxy catalogues that match the 
observed galaxy-galaxy two-point correlation function over 


the range of separations of interest and on which we can 
apply a procedure for finding clusters that closely mimics 
Abell’s selection criteria. The prescription we use is based 
on the peak-background split technique outlined in BBKS. 
Further information abou t the detailed implementation may 
be found in White et al. (1987). 

The BBKS formalism gives the number density of peaks 
of a certain height in a field F filtered on a galaxy scale, Vs, 
even though the simulations do not resolve the field, F, on 
that scale, but only on a scale rj, > rs. The scale Vs we asso¬ 
ciate with individual galaxies and corresponds to ~ 1 O^^M 0 . 
Now let Fa denote the Gaussian random density field, F, 
smoothed with a Gaussian filter of width rs. We associate 
a galaxy with a density peak in excess of Vaas, where a a 
is the rms value of the field Fa. Similarly, we define the 
quantities E), and ai, for the same field, F, now filtered on a 
scale rf,. The full expression for the quantity J\fpk{i'a\i'b)di'a, 
the number density of peaks in the field Fa with height in 
the range Uaaa to (ua + di/a)aa, at points where the back¬ 
ground field has the value i^bab, is given in Appendix E of 
BBKS. In order to ensure that the correlation properties of 
the galaxies reflect those of the underlying distribution of 
peaks in Fa we choose a suitable filter function to obtain Fb- 
By using a sharp low-pass filter in fe-space, we eliminate the 
correlations between the difference field Fa — Fi, and E),. The 
price of this is that oscillations in the correlation function of 
the background field only vanish asymptotically. However, 
this effect is negligible on the scales of interest and, since 
we are not seeking to provide an exact match to the galaxy 
distribution, this scheme is quite sufficient for our purposes. 

The procedure outlined above was implemented as fol¬ 
lows: the initial density distribution (sampled on a 128® grid) 
was smoothed by removing all power below r(,=8.75/i“®Mpc. 
A tabulated version of jg'pk{i'a\i'b)di'a was then used to find 
the ’peak number’ associated with each point on the grid 
of the smoothed initial density field. In order to construct a 
volume-limited catalogue, we determined the total number 
of galaxies by requiring that the luminosity density of our 
model catalogue be consistent with recent determinations of 
the luminosity function (e.g. Loveday et al. 1992; Marzke et 
al. 1994). Our adopted value of pl = 0.0176E*E®Mpc“® lies 
in the range bracketed by the observational data which differ 
by rather large amounts. This normalization reproduces the 
observed abundance of Abell clusters (~ 8 x 10“®/i®Mpc“®, 
Scaramella et al. 1991) when we identify clusters using the 
method described in the next subsection. The only other two 
parameters Ua = 1.2 and Va = 0.54/i“^Mpc were chosen to 
match the observed galaxy-galaxy correlation function, ^gg 
(see Figure 1). For our chosen values of Ua and ri,, the mean 
luminosity associated with each peak is 0.87 E*, and the 
mean luminosity associated with each particle is 0.14 E*. 
(With the large number of particles in these simulations, no 
oversampling is necessary, even in the densest regions; see 
White et al. 1987). Where necessary, we have assumed that 
the distribution of galaxy luminosities follows a Schechter 
function: 


(t>{L)dL = iV.E"“ exp(-E) /r(2 - a)dL, (1) 

where E is expressed in units of E*, and E is the gamma 
function. The final galaxy catalogues are constructed by ran¬ 
domly selecting particles in the simulation volume and iden¬ 
tifying them as galaxies with a probability directly propor- 
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Abell richness of 2D cluster Abell richness of 2D cluster 


Figure 2. The total luminosity of the main cluster (solid 
squares), together with the second (crosses) and third (open cir¬ 
cles) brightest clumps along the line-of-sight to each cluster, as 
a function of the galaxy count in 2D. The dashed line represents 
the 3D luminosity threshold of 43Z/* required for an Abell cluster 
of richness class R=l. No background count has been subtracted 
from the 2D cluster richness count. 


Figure 3. The X-ray luminosity (within a detect cell of radius 
0.75h“^Mpc) of the main cluster (solid squares), together with 
the second (crosses) and third (open circles) brightest clumps 
along the line-of-sight to each cluster, as a function of the galaxy 
count in 2D. The dashed line corresponds to a luminosity of 
1.8x lO'^^erg/s, the lowest detection in the survey of Briel & 
Henry (1993) 


tional to the ‘peak number’ associated with the grid point 
nearest that particle in the initial conditions. The galaxy 
inherits from the particle both its position and velocity at 
later times. 


2.3 Cluster Selection 


Abell defined a rich clnster as an enhancement of galax¬ 
ies on the Palomar Sky Survey plates. To qualify as a 
cluster, the number of galaxies within an ‘Abell radius’ 
{va = 1.5/i“^Mpc) from the proposed cluster centre had to 
exceed a certain number, Ua, after background subtraction. 
Galaxies were counted in the magnitude interval between m 3 
and m 3 + 2 , where m 3 is the apparent magnitude of the third 
brightest galaxy. For a cluster of Abell richness class R=l, 
the lowest richness class that can be considered reasonably 
complete, Ua > 50, while for a cluster of Abell richness class 
2 , ria > 80. 

As starting point for the construction of mock cata - 
logues that encapsulate the main features of Abell’s (195f) 


cluster catalogue, we use volume-limited galaxy samples. We 
assume that the luminosity distribution of the cluster galax¬ 
ies is drawn from a Schechter function as given in equation 
(1). We construct catalogues that have the correct number 
of galaxies to cover the interval down to two magnitudes 
below that of the median value of the third most luminous 
galaxy drawn from a cluster with a true luminosity equal to 
that expected for a cluster with R=l. Since the box length 


in our simulations is smaller than the effective path length 
to a typical Abell cluster, this limit is equivalent to Abell’s 
counting limit. The number of galaxies projected onto a typ¬ 
ical model cluster, however, is smaller than the number of 
galaxies along the-line-of sight to a moderately distant Abell 
cluster. Thus our procedure will underestimate the degree 
of contamination by distant background galaxies. 

Each volume-limited catalogue was projected along the 
coordinate axes onto three orthogonal planes, thus produc¬ 
ing three different 2D galaxy catalogues from each simu¬ 
lation. A friends-of-friends group finding algorithm (Davis 
et al. 1985) with a linking length 30 per cent of the mean 
inter-galaxy separation was then applied to the 2D galaxy 
catalogues. The resulting list of groups is the starting point 
for the cluster search. (The final cluster catalogues are insen¬ 
sitive to the choice of linking length in the range ~ 0.3 —0.6.) 
The total luminosity projected within the Abell radius, Va, 
is Nt,Lt. The normalization is thus fixed by the quantity Nt 
which we now calculate. 

The cumulative distribution of luminosities is found by 
integrating the luminosity function 

/ OO 

(t>{L')dL' = r{l-a, L)N^ ( 2 ) 

where T{a,L) is the incomplete gamma function. The dis¬ 
tribution of the jth brightest cluster member is given by 

(j}j{L)dL = exp [-'ip{L)]<j){L)dL, (3) 
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( Bchechter 1976| ). For the third brightest galaxy this gives, 
(f)3{L)dL = exp [-'ip{L)\4>{L)dL (4) 

and consequently 

/ OO 

MLjdP 

= 1-(l+ip{L) + ^'ilj^{L)'jexp[-ip{L)]. (5) 

The median luminosity of the third brightest galaxy, L3, is 
found by solving V'3(i) = 51 which gives '(/’(^s) = 2.674 and 
therefore L3 is the solution to: 

00 

^(Ls) = j 4>{L')dL', ( 6 ) 

L3 

which yields L3=2.11. Bearing in mind that Abell’s richness 
count is defined over a 2 magnitude interval, we get the 
following relation between the normalization parameter, N ,, 
and the galaxy count, Ua'- 

^ r(2^ q) / L~°‘exp{-L)dL. (7) 

0 . 1585^3 

For R=1 clusters, A* = 60 for a = 1. N^: depends only 
weakly on the value of a (it varies by less than 10 per cent 
over the range a=l.0—1.5). A* is therefore not very sensitive 
to the highly correlated and poorly constrained luminosity 
function parameters. 

The 50 galaxies between m3 and m3+ 2 represent a total 
luminosity within of 60L*. Thus, to ensure that all R>1 
clusters in the simulation volume, V, are detected requires 
a volume-limited catalogue with 

50 

( 8 ) 

galaxies. Since the total number of galaxies that resides in 
the clusters is small compared with the total number in 
the box, we assume that the contamination by foreground 
and background galaxies is proportional to the volume pro¬ 
jected onto the cluster, i.e. to the number of galaxies within 
a cylinder of volume Tvralbox {hox = centred on the 

cluster. Within an Abell radius we expect, on average, 27 
background galaxies in addition to at least 50 cluster mem¬ 
bers. 

Each of the groups identified using the friends-of-friends 
algorithm in the 2D galaxy catalogue was checked to see 
whether the number of galaxies within the Abell radius ex¬ 
ceeded the 76 needed to qualify as an R=1 cluster. The 
poorer of a pair of overlapping clusters (projected separa¬ 
tion < 2ra) was removed. On average the number of clus¬ 
ters found per catalogue was 139, corresponding to a number 
density ~ 8 x 10“®/i®Mpc“®. 

One of the quantities that we wish to calculate is the 
fraction of clusters that would actually meet Abell’s crite¬ 
rion if this were applied in three dimensions. For this we 
need to convert the 2D luminosity threshold found above 
(A^* = 601/*) to a 3D threshold. This requires making a 
correction for the galaxies that are projected onto the clus¬ 
ter, but lie outside r^. We assume that an average spherical 


cluster can be adequately described by a power-law density 
profile p(r) oc r~^. The correction is the ratio of Mppa), 
the mass seen in projection within a cylinder of radius Va, 
to the total mass Mpa) within the aperture in 3D. For the 
power-law density profile these masses are. 


Mp{r) 


4^ ,3-. r(|)r(^) 
3-7 2r(2) 


( 9 ) 


M{r) 


An 

3-7 


^3-7 


( 10 ) 


Takin g 7 = 2.2, the value derived by Lilje & Efstathiou 
( 1988| ) from their determination of the cluster-galaxy cross¬ 
correlation function, we find a ratio M{ra)/Mp{ra)=0.72 
and therefore a 3D threshold for richness class R>1 of 
43Lt. The assumed value of 7 is in good agreement with 
7 = —2.3 ± 0.4 found directly from the luminosity weighted 
particle distribution over the interval between 0.1 and 1.5 
/i“^Mpc. The mean slope found for the 2D cluster cata¬ 
logue is —2.0±0.3, slightly lower as a result of contamination 
caused by projection effects. 


3 THE COMPLETENESS OF CLUSTER 

CATALOGUES 

The effects of contamination by foreground and background 
galaxies on the completen ess of the Abell catalogue were 
first considered by Lucey (1983). His Monte-Carlo simula¬ 
tions, however, were crude because they did not take into ac¬ 
count the clustering of the contaminating galaxies. By con¬ 
trast, our mock galaxy catalogues reproduce the low-order 
clustering statistics of the real galaxy distribution over the 
range of scales of interest and so they provide a much better 
approximation to the source of projection effects. 

To investigate the reality of clusters identified in pro¬ 
jection, we compare our mock 2D cluster catalogues with 
catalogues of clusters identified in 3D. The latter were con¬ 
structed from the centres returned by the friends-of-friends 
group finder applied to the dark matter distribution, using a 
linking length of 10 per cent of the mean inter-particle sepa¬ 
ration. (Again, the results are insensitive to this choice.) The 
luminosity of these 3D clumps was calculated by summing 
over the total luminosity associated with the particles con¬ 
tained within a sphere with radius 1.5/i“^Mpc. Only groups 
with more than eight particles were considered. With each 
2D ‘Abell’ cluster, we associate all 3D clusters whose centres 
fall within the Abell radius of the projected cluster. Figure 2 
shows the distribution of luminosities of the three most lu¬ 
minous 3D groups identified along the line-of-sight towards 
each Abell richness class R>1 cluster in one of our 24 cata¬ 
logues. The 3D threshold of 43L* required for richness class 
R=1 is shown by the dashed line. Averaged over all galaxy 
catalogues, we find that 34 ±6 per cent of ‘Abell’ clusters are 
not associated with a 3D cluster which, on its own, meets 
the generalized Abell criterion. These clusters are only seen 
above the threshold in projection because of the superposi¬ 
tion of several sub-threshold clumps along the line-of-sight. 
Conversely, we find that 32±5 per cent of the 3D clumps 
brighter than 431/* fail to be picked out as Abell clusters 
in projection. In most cases, such non-detections are due to 
fluctuations in the background count as a result of which in¬ 
trinsically rich clusters are misclassified as poorer groups. In 
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Figure 4. The top panel shows the distribution of galaxies in two R=1 clusters (numbers 40 and 120, left two columns) and two R=2 
clusters (numbers 68 and 86). Different sections along the line-of-sight have been plotted with different symbols, as indicated in the 
legend at the bottom of each figure. (The numbers in the legend are intervals in units of Mpc/h.) Open symbols are used for galaxies in 
front and solid symbols for galaxies behind the cluster, while crosses denote galaxies that belong to the cluster. The middle row shows the 
corresponding distribution of radial velocities in a cylinder centred on the cluster. The shaded part of the histogram shows the galaxies 
that are classed as cluster members after a 3cr clipping procedure (see below); the unshaded regions have been classed as foreground and 
background objects. The lower panel shows true positions along the line-of-sight. Note that these lower two panels are centred on the 
cluster. The dashed lines denote an 8000 km/s window, centred on the cluster, used to make an initial selection for the determination of 
velocity dispersions. These clusters are only weakly contaminated by projection effects. 


addition, in 5 per cent of cases, we find that two 3D clumps 
above the threshold are associated with a single 2D object. 
Thus, in our simulations, about a third of clusters classified 
as Abell R>1 clusters are, in fact, poorer groups, whilst a 
similar fraction of intrinsically rich clusters, are not included 
in the catalogue at all. We conclude that catalogues selected 
using Abell’s criteria are neither homogeneous nor complete 
to a uniform integrated luminosity limit. 

X-ray emission from the hot intra-cluster medium pro¬ 
vides an alternative means of selecting cluster samples. No 
extensive sample selected exclusively on the basis of X-ray 
data exists to date. A first step in this direction is the cata¬ 
logue of clusters compiled by Romer et al. (1994) who cor¬ 
related a sample of X-ray sources with optical galaxy counts 
from digitized photographic plates. In most studies however, 
the cluster catalogue is taken as the starting point for fur¬ 
ther selection according to X-ray flux or luminosity (e.g. 


Nichol, Briel & Henry 1994). Any incompleteness present in 
the cluster catalogue is then automatically carried forward 
to the X-ray sample. Nevertheless, it is of interest to ask 
how such X-ray cluster samples are further affected by pro¬ 
jection effects. Our simulations do not follow the gas compo¬ 
nent responsible for the X-ray emission from clusters. How¬ 
ever, we can use the results of recent N-body/gasdynamics 
simulations of the formation of individual clusters to cal¬ 
culate, approximately, the expected X-ray luminosity from 
our clusters. These simulations show that the collapse and 
shock heating of a non-radiative gas during cluster forma¬ 
tion establishes a near equilibrium configuration in which 
the density profile of the gas closely follows that of the dark 
matter (Evrard 1990; Thomas & Couchman 1992; Navarro, 
Frenk & White 1995, but see Anninos & Norman 1996). We 
can therefore estimate the expected X-ray emission from a 
cluster by associating with each particle in our simulations 
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an ‘X-ray luminosity’ proportional to the product of the lo¬ 
cal density and velocity dispersion, Lx oc /9cr„. Both these 
quantities are calculated by averaging over the 10 nearest 
neighbours of each particle. When summed over a group of 
particles, the total ‘X-ray luminosity’ has the same depen¬ 
dence on temperature and density as Bremsstrahlung emis¬ 
sion, Lx oc 

Figure 3 shows the ‘X-ray luminosities’ of the same clus¬ 
ters plotted in Figure 2. The model luminosities were nor¬ 
malized by setting the median luminosity of the brightest 
sub-clump along each line-of-sight equal to 10“^"^erg/s, close 
to the median X-ray luminosity in the 0.5-2.5 keV band of 
a sample of 145 hi gh ga lactic latitude Abell clusters studied 
by Briel & Henry ( [l993|) . These luminosities were calculated 
in an inner sphere of radius 0.75/i“^Mpc, similar, in projec¬ 
tion, to the optimal detect area recommended by Briel & 
Henry for their sample. The dashed line in the Figure corre¬ 
sponds to a luminosity of 1.8x 10"*®erg/s, the lowest 0.5-2.5 
keV rest-frame X-ray luminosity detected by Briel & Henry. 
These authors detected only 46 per cent of the 145 clusters 
in their sample. In part this is due to uneven sky coverage 
which also leads to upper limits on non-detections that vary 
between 1.6x10'*^ and 2.7x 10’‘'*erg/s. These variable lim¬ 
its preclude any conclusions regarding the completeness of 
the X-ray samples. Nevertheless, Figure 3 shows that even 
X-ray selection is not immune from projection effects. A sig¬ 
nificant fraction (16 per cent) of the second-ranked clumps 
and even some of the third-ranked clumps along the line-of- 
sight have X-ray luminosities above the minimum detected 
in the data. In a few cases, the second-ranked clumps have 
comparable luminosities to the first-ranked clumps. Only 
the brightest clusters, those with Lx^W'^'^ erg/s, provide a 
clean, although not necessarily complete, sample. We there¬ 
fore conclude that even X-ray selected samples are, to some 
extent, contaminated by projection effects, although this 
seems to be a weaker effect than for optically-selected sam¬ 
ples. Projection effects are reduced because of the high cen¬ 
tral concentration of the X-ray emission and the use of a 
relatively small detect area. Examination of an analogous 
plot to that in Figure 3, using X-ray luminosities within 
1.5h“'^Mpc rather than within 0.75 /i~^Mpc, shows that 
projection effects are much stronger for the larger detect 
areas. 


4 STRUCTURE IN THE REDSHIFT SPACE 
DIRECTION 

The ‘optically selected’ clusters found using the techniques 
outlined above show considerable variation in structure and 
richness. A significant fraction, ~ 30-40 per cent, of the R>1 
clusters are almost uncontaminated and clearly meet the cri¬ 
terion derived above for an Abell cluster in 3D. Examples of 
such clusters are shown in Figure 4. The lower panel shows 
a histogram of the true spatial distribution of galaxies along 
the line-of-sight, in bins of width 2.56/i“^Mpc (The vertical 
axis has been cut off at 26 galaxies.) These distributions are 
dominated by a single concentration which is much richer 
than any of the sub-clumps along the line-of-sight. The mid¬ 
dle row shows the distribution of radial velocities of all galax¬ 
ies in the direction to the cluster. The histograms are not 
always Gaussian or even symmetric (Note the high velocity 


tail at 02 = 20 ,000km s”*^ in cluster 68), even though the clus¬ 
ter galaxies are well localized in space. (All cluster members 
in the lower panel of cluster 68 fall in a single 2.56h“^Mpc 
bin.) The top row shows the distribution of galaxies on the 
sky. Different symbols indicate background, foreground and 
cluster members (on the basis of the positional information 
in the lower row). All clusters appear regular and, in many 
cases, quite spherical. Although this is typical of practically 
all uncontaminated clusters, there are a few examples of 
elongated objects (e.g. cluster 86). 

The remaining 60-70 per cent of clusters suffer from 
some kind of projection effect. Figure 5 shows four exam¬ 
ples of clusters that have been contaminated by other struc¬ 
tures. In these cases, had contamination been removed, the 
richest remaining group along each line-of-sight would still 
have qualified as an R=1 cluster. Although the velocity his¬ 
tograms do not differ greatly from those of the uncontami¬ 
nated clusters in Figure 4, closer inspection reveals that, in 
some instances, the foreground and background groups ap¬ 
pear to form identifiable structure when viewed face-on (e.g. 
the open triangles in cluster 62); in other cases the contami¬ 
nating galaxies are distributed randomly over the entire field 
(e.g. solid squares in cluster 1). Cluster 24 is an example of 
a binary cluster. The smaller component, containing about 
30 galaxies (open squares), is located 5 /i“^Mpc in front 
of the main cluster which contains ~55 galaxies. Yet, the 
radial velocity distribution in the middle panel is quite sym¬ 
metric and shows no clear evidence of bimodality. Although 
these projection effects are not as severe as those that lead 
to poor groups being detected and classed as rich clusters, 
they affect the richness assigned to a cluster and compro¬ 
mise statistical studies including, for example, estimates of 
the cluster abundance as a function of richness. 

More extreme cases of misidentification and misclassi- 
fication are shown in Figure 6. The main peak on which 
the histograms are centred was found by filtering with a 
4000 km s“^ top hat function. The main galaxy concentra¬ 
tion fails to meet the 43L* luminosity criterion for an R=1 
Abell cluster in 3D, although the clusters do meet the 2D 
criterion of more than 50-1-27 galaxies in projection. A sig¬ 
nificant fraction of all clusters that fall into this category 
are not as centrally concentrated as the clusters in Figures 4 
and 5. However a surprisingly large number looks little dif¬ 
ferent, considering that they are pure superpositions. We 
find a great variety in the nature of these superpositions. 
Some consist of multiple groups of roughly comparable rich¬ 
ness (e.g. clusters 2 and 54), others of two main clumps of 
comparable size (e.g. cluster 10). 

Closer inspection of the histograms of a random subset 
of all our R> 1 clusters reveals that in roughly 14 per cent of 
cases the main peak along each line of sight fails to contain 
even the 30 galaxies required for an R=0 classification. In 36 
per cent of clusters the count lies between 30 and 50 (R=0), 
in 40 per cent between 50 and 80 (R=l) and in 10 per cent 
above 80 (R>2). An average R=1 cluster contains 66 per 
cent of the galaxies that, when projected, form the cluster 
in 2D. This is in excellent agreement with our background 
estimate. 

Although it is often assumed that the radial velocities 
of virialized clusters follow a Gaussian distribution, even a 
small subcluster can cause a detectable asymmetry. We have 
examined the degree to which the radial velocity distribu- 
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Figure 5. As Figure 4, but for mildly contaminated clusters. The main galaxy concentration in these clusters still has richness R>1 after 
contamination has been removed. Such clusters clearly belong in the Abell catalogue, but their richness has been artificially enhanced 
by projection effects. 


tion is Gaussian using a variety of indicators. In addition 
to calculating the skewness and kurtosis of the radial ve¬ 
locity distribution, we have also applied a Lilliefors test (a 
Kolmogorov-Smirnov test that takes into account the fact 
that both the mean and dispersion are estimated from the 
dataset itself). In all cases we estimate confidence intervals 
by comparing the distribution of normalized ‘observed’ ve¬ 
locities, Zi = {vi — v)/a, with random samples drawn from a 
normal distribution. All three tests find that, at the 95 per 
cent confidence level, between 45 and 55 per cent of clus¬ 
ters identified in 2D have velocity distributions which are 
inconsistent with being Gaussian. The percentage of clus¬ 
ters rejected at 90 per cent and 99 per cent confidence levels 
are ~ 55 per cent and ~ 35 per cent respectively. 

Another illustration of the effects of sub-clumping is 
given in Figure 7. Here we plot the peculiar velocities, 
Vpec = Vr — vh, where vh is the Hubble velocity, against 
position {vh / H) for galaxies in the twelve clusters shown in 
Figures 4-6. (In each case we have limited the total range 
along the line-of-sight to 100fe“^Mpc for clarity on smaller 
scales.) Even those clusters which we earlier regarded as 
essentially ‘clean’ (top row) suffer from small amounts of 
contamination. The fraction of clusters that does not have a 


group of at least 5 members projected onto its face is below 
2 per cent. In a number of cases the mean of the peculiar 
velocity distribution is noticeably displaced from zero, as a 
result of motions induced by distant matter or by the prox¬ 
imity of a massive neighbouring cluster. This is clearly seen 
in the case of the ‘binary’ cluster (number 24) and in the 
case of the multiple cluster (number 54). Several examples 
show infalling sub-clumps. Just in front of the main mass 
concentration in cluster 86, a group of 15 galaxies is falling 
into the cluster, with velocities that increase towards the 
cluster core. A similar situation occurs in cluster 1, but now 
there are at least two sub-clumps falling in from different 
directions. In cases where the cluster fails to meet the crite¬ 
rion for inclusion in the 3D ‘Abell catalogue’, the number of 
different small subclusters or groups that make up the 2D 
cluster can be substantial (e.g. cluster 54). 


5 CLUSTER VELOCITY DISPERSIONS 

The velocity dispersions of galaxy clusters, or the cluster 
masses derived from them, are an important cosmological 
diagnostic. They have been used, for example, to test spe¬ 
cific cosmological models (Weinberg & Cole 1992; Bahcall 
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Figure 6. As Figure 4, but for severely contaminated clusters. The main galaxy concentration in these clusters has R<1 after projection 
effects are removed. These clusters should not have been included in the Abell catalogue. 


& Cen 1993; Lubin et al. 1996), and to estimat e the am¬ 
plitude of mass fluctuations on cluster scales ( Evrard & 
Henry .991; White et al. 199^ ; Eke et al. 1996b). The dis¬ 
tribution of velocity dispersion is often used in cumulative 
form, n(> (Jv,ios), the number density of clusters with line- 
of-sight velocity dispersion greater than a^^ioa- A closely re¬ 
lated quantity is the cumulative X-ray temperature function, 
n(> Tx), derived from X-ray observations in the central 
parts of clusters. 


Frenk et al. (199C) calculated n(> av,ios) for an en- 
s embl e of = 1 CDM simulations and Weinberg & Cole 
(1992) calculated this quantity for various Gaussian and 


non-Gaussian models with 17=1 and 17=0.2. On the obser¬ 


vational side, Mazure et al. (1996) have recently estimated 


n(> a- ujos) for a sampl e of 80 clusters from the AGO cata¬ 
logue (Abell et^^ah^JRS^^ . They determined each cj„ i„., using 


at least 10 (and in 48 out of 80 clusters more than 30) radial 
velocities per cluster. These data represent a considerable 
improvement upon previous compilations (e.g. Zabludoff et 
al. 1993; Bahcall & Cen 1993). 

The main concern when estimating cluster velocity dis¬ 
persions is the effect of contamination by unvirialized sub¬ 
clumps whose coherent motion remains hidden by the am¬ 
plitude of the peculiar velocities in the cluster region. As 
Figures 4-6 show, it is not uncommon for a superposition of 
clumps, separated by 10 Mpc or more, to produce an appar¬ 


ently symmetric and featureless radial velocity distribution. 
Using our mock catalogues, we now examine the accuracy 
of various conventional methods for estimating velocity dis¬ 
persions in ‘observational’ (from now on referred to as 2D) 
cluster samples by comparing results obtained from them 
with the ‘true’ velocity dispersions computed directly using 
the full phase-space information for the dark matter parti¬ 
cles within a spherical aperture. 

To estimate the velocity dispersion of our 2D samples, 
we hrst convolved the ‘observed’ velocity histogram with a 
4000 kms“^ top hat filter in order to reject obvious interlop¬ 
ers and obtain an initial estimate of the mean radial veloc¬ 
ity of the cluster. All galaxies with relative velocities greater 
than 4000 km s“^ from the peak of the convolved histogram 
were then excluded from the sample. Next , we applied a 
stand ard optimistic On-clipping procedure (Yahil & Vidal 


1977) which consists of the following steps: (i) estimate the 


mean radial velocity v and velocity dispersion n; (ii) delete 
all galaxies with radial velocity greater than Scr away from 77; 
(iii) estimate v and a for the culled sample; and (iv) remove 
the most extreme galaxy if its radial velocity is greater than 
3a from v. Steps (iii) and (iv) are repeated until the number 
of galaxies stabilizes. This procedure returns a robust and 
stable estimate of the cluster velocity dispersion, 

In panel (a) of Figure 8 we plot the values of ob¬ 
tained by applying this method to the clusters in one of our 
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Figure 7. The radial component of the peculiar velocity plotted against the line-of-sight position of galaxies in clusters. The top row 
shows the R=1 and R=2 clusters of Figure 4, the middle row shows the mildly contaminated clusters of Figure 5, and the bottom row 
shows the severely contaminated clusters of Figure 6. 


mock Abell catalogues against the true values, ob¬ 

tained using all the dark matter particles within the Abell 
radius, Va, of each cluster. (The latter are l/\/3 times the 
full 3D velocity dispersion of the dark matter particles.) 
All qualifying galaxies projected within Va were used in 
the estimate of Gv, 2 d- There is a very poor correlation be¬ 
tween these estimates and the true dispersions. In partic¬ 
ular, the 2D distribution has a tail of high dispersion clus¬ 
ters which is completely spurious. The true distribution con¬ 
tains only 1 cluster with > 1250 kms“^ and none 

with (Jv, 3 D > 1500kms“^. Yet the ‘observational’ sample 
has 33/153 clusters with > 1250kms“iand 20/153 

with a^^ 2 D > 1500 kms“^. These artificially large disper¬ 
sions are caused by unvirialized clumps of galaxies projected 
onto the main cluster. A good example is cluster 40, shown 
in Figures 4 and 7. The derived dispersion for this clus¬ 
ter is crv, 2 D ~ 1400 kms“^, much larger than the true value 
o'v, 3 D = 1020 kms“^. The main culprit is a group of 8 galax¬ 


ies located 35h“^Mpc in front of the cluster which is not 
eliminated by the optimistic 3cr clipping procedure. Accord¬ 
ing to a Lilliefors test, the hypothesis that the radial velocity 
distribution of this cluster is Gaussian cannot be rejected 
with more than 25 per cent confidence. 

In practice, velocity dispersions for real clusters are 
most commonly estimated using only central galaxies, rather 
than galaxies spread out over the entire Abell circle, as we 
have assumed in panel (a) of Figure 8. In panel (b) of this 
figure we show the result of estimating cr„,2D from galax¬ 
ies projected only onto the inner 0.5h“^Mpc of the clus¬ 
ter. By sampling a smaller area, the number of contaminat¬ 
ing clumps is reduced and, as a result, the spurious tail of 
high cr„,2D clusters is diminished but not altogether elim¬ 
inated. The correlation between cr„,2D and (t„, 3 d remains 
rather poor. As before, dispersions for a substantial fraction 
of the population are overestimated and, in several cases, 
they are signihcantly underestimated. 


© 0000 RAS, MNRAS 000, 000-000 












Projection effects in cluster catalogues 11 



Figure 8. Comparison of ‘true’ velocity dispersions {(Jv, 3 d) and various ‘observational’ estimates (iTu, 2 d)- In all cases, the ‘true’ values 
are l/VS times the 3D velocity dispersion of the dark matter particles within a sphere of the appropriate radius, (a) Clusters identified 
using Abell’s criterion and (T„_ 2 D estimated using an optimistic 3cr clipping procedure applied to all galaxies projected within the Abell 
radius, (b) Clusters identified using Abell’s criterion and 2 D estimated using an optimistic 3 (t clipping procedure applied only to 
galaxies projected within a circle of 0.5/i“^Mpc radius, (c) Clusters identified using a scaled version of Abell’s criterion to a radius of 
0.5/i“^Mpc and (t„ 2 D estimated using an optimistic So- clipping procedure on the galaxies projected within this radius, (d) Clusters 
identified using Abell’s criterion and (T„_ 2 D estimated using the interloper removal algorithm of den Hartog &; Katgert (1996). 


The main reason why restricting the radial velocity sam¬ 
ple to the inner regions of the cluster does not produce a 


Abell’s catalogue by a different one constructed using a 
smaller sea rch radius. The APM cluster catalogue (Dalton 


more satisfactory result is that the criteria used to select 
clusters in the first place already produces a heavily con¬ 
taminated catalogue. This is due to the large search radius 
employed by Abell. To illustrate this point we construct a 
new set of cluster catalogues in which the search radius has 
been reduced by a factor of three, to 0.5 /i~^Mpc. Assuming 
a mean radial density profile, p(r) oc the number of 

galaxies required for richness class R>1 within this reduced 
radius is 24. This scaled Abell criterion produces different 
cluster catalogues than Abell’s standard criterion. We now 
apply the same optimistic So- clipping procedure as before 
(using galaxies within 0.5/i“^Mpc). The resulting values of 
o'v,2D are compared with the true values, cr„,3D (now calcu¬ 
lated using dark matter particles within a sphere of radius 
0.5 /i~^Mpc) in panel (c) of Figure 8. The result is consider¬ 
ably better. Virtually all the spurious large dispersions are 
eliminated and the correlation between the 2D estimates and 
the true values is considerably tighter. 

The dramatic improvement shown in panel (c) can¬ 
not, of course, be achieved in practice without replacing 


et al. 1992) fulfills this criterion although the search radius 
of 0.75 /i“^Mpc is somewhat larger than our recommended 
value of 0.5/i“^Mpc because the latter did not yield a suf¬ 
ficient number of galaxies in the APM data. However, a 
weighting scheme was applied which reduces the weight of 
galaxies in the outer 0.25 /i“^Mpc ring. Our mock catalogues 
indicate that the velocity dispersion distribution of APM 
clusters should be considerably more reliable than that of 
Abell clusters. Unfortunately an extensive redshift survey 
of APM clusters has yet to be undertaken. 

Finally, we have tested the inte rlope r removal method 
proposed by den Hartog & Katgert (1996). This method at¬ 


tempts to find an acceptable range for the radial velocities to 
be included in the galaxy sample depending upon the pro¬ 
jected distance from the cluster centre. Using an iterative 
scheme, the method rejects galaxies with radial velocities 
differing by more than Vmax{R) from the systemic velocity, 
where Vm,ax{R) is the maximum of the line-of-sight compo¬ 
nent of two velocities: (a) the infall velocity for all positions 
along the line-of-sight at a given projected separation, R, 
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from the cluster centre, and (b) the circular velocity. In order 
to calculate the infall velocit y a va lue of must be assumed, 
but den Hartog & Katgert ( 199E| ) found that their results 
do not depend signihcantly on the choice of fl. This method 
succeeds in eliminating more galaxies than the optimistic 
3a clipping routine. The comparison between the velocity 
dispersions obtained using this method and is shown 

in panel (d) of Figure 8. The ‘observational’ estimates cor¬ 
relate better with the true values than the estimates used 
in Figures 8(a) and 8(b), but not as well as those in Fig¬ 
ure 8(c). The den Hartog-Katgert method tends to underes¬ 
timate most velocity dispersions by ~ 100 km s“^while, at 
the same time, failing to completely remove the spurious tail 
of large dispersions. 

The cumulative distributions of velocity dispersion re¬ 
turned by the various methods discussed above are com¬ 
pared with the true distribution in Figure 9. The former were 
constructed by combining data from all 24 mock cluster cat¬ 
alogues. The true distribution, calculated from the velocity 
dispersion of the dark matter particles within 1.5h“'^Mpc 
of the centre, is shown as the solid line. (All clusters with 
av,ios > 500kms“^ are included.) These data agree well 
with the cumulative mass distribution of clusters calculated 
by White et al. (1993) from N-body simulations of the same 
model. The different symbols show the distributions derived 
for the mock Abell cluster samples using different methods: 
the solid squares are the estimates obtained by applying the 
optimistic 3a clipping to all galaxies within ra', the open 
squares correspond to the case when this estimator is ap¬ 
plied only to galaxies within r^/S; the solid triangles give the 
result of using the den Hartog-Katgert method. The dashed 
line shows the distribution of dispersions for clusters identi- 
hed in 2D but using a reduced search radius of ra/3 and the 
scaled Abell richness criterion. The dispersions in this case 
were also derived using the 3a clipping procedure. (Note that 
the total number of clusters identified in this case is about 
a factor of 2 larger than the number of Abell clusters.) 

At values of the velocity dispersion below ~ 850 kms“^ 
all the distributions from the mock catalogues in Figure 9 
include only a steadily decreasing fraction of all the clus¬ 
ters present in the simulation. This turnover simply reflects 
the threshold richness required for selection which biases the 
sample against low-(j„ clusters. At large values of the veloc¬ 
ity dispersion, only the distribution for clusters identified 
with a small search radius approximates the true distribu¬ 
tion. All other catalogues overestimate the number of high 
dispersion clusters by large factors. Even the den Hartog- 
Katgert method which slightly underestimates the disper¬ 
sion of small clusters over-predicts the abundance of clus¬ 
ters with av,ios > 1500kms“^ by a factor of ~ 5. Note, 
however, that the tail of the distribution is exaggerated in 
a logarithmic plot like this. In fact, at the typical abun¬ 
dance of Abell R> 1 clusters, all methods perform quite well. 
For example, the velocity dispersion at an abundance of 
4 X 10“® h®Mpc“®, half the abundance of R>1 clusters, the 
dispersions obtained from the different methods lie within 
~ 150kms“^ of the true value. Thus, while the median ve¬ 
locity dispersion of the Abell cluster population is reason¬ 
ably well determined, the high dispersion tail of the distribu¬ 
tion is extremely uncertain. Figure 9 indicates that statisti¬ 
cal studies based on the median mass or velocity dispersion 
of the Abell cluster population (e.g. White et al. 1993) are 


robust whereas studies that concentrate on the tail of the 
distribution (e.g. Bahcall & Cen 1993) are unreliable. The 
APM and EDCC catalogues have a higher number density 
and are less affected by projection effects than R>1 Abell 
clusters. One would therefore expect a sample of such clus¬ 
ters to show a reduction in the spurious high-CT„ tail com¬ 
pared with a sample of Abell clusters 

Also plotted in Figure 9 are two determinations of 
n{> a^^ioa) from observational sam ples. T he solid circles 
show the estimate by Mazure et al. (1996) for a sample of 


80 clusters analyzed with the den Hartog-Katgert algorithm. 
The open circles show an estimate derived from Bahcall & 
Cen’s (1993) mass function of clusters, assuming that the 
transformation between mass and velocity dispersion is that 
given by their equation (3) for an isotropic distribution of 
galaxy velocities. The two observational estimates lie below 
the model predictions. This is consistent with the results 
of Frenk et al. (1990), White et al. (1993) and Eke et al. 
(1996b) which show that the abundance of clusters is cor¬ 
rectly reproduced in a CDM model only if 6 ~ 2. 


6 SUBSTRUCTURE 

Substructure in a cluster is symptomatic of a recent merger 
or accretion event. Typical survival times of accreted groups 
and subclusters are comparable to the crossing time (a few 
times 10® yr) and thus shorter than the age of the main 
cluster (Evrard 1990, Gonzalez-Casado, Mamon & Salvador- 
Sole 1994). The presence of substructure is therefore related 
to the recent history of a cluster and provides a useful way 
of constraining cosmological models in which cluster growth 
rates differ (Richstone et al. 1992; van Haarlem & van de 
Weygaert 1993; Bartelmann et al. 1993; Mohr et al. 1995; 
Wilson et al. 1996). In practice, however, identifying sub¬ 
structure in a cluster using the properties of its galaxy dis¬ 
tribution is difficult, and the results depend upon the test 
used to dehne and detect substructure. 

Many statistical tests have been proposed over the years 
to measure substructure. The first atte mpts employed only 
2D-data. For example, G eller fc Beers (1982) made contour 
plots based on Dressler’s (198C) measurements of galaxy po¬ 
sitions on the sky and concluded that ~ 40 per cent of clus¬ 
ters showed signs of subs tructu re. This claim was contra¬ 
dicted by W est fc Bothun (1990) and by Rhee, van Haarlem 
& Katgert (1991) who applied a range of statistical tests 
but found little evidence for substructure in the projected 
galaxy distribution in the inner regions of clusters. The in¬ 
clusion of radial velocity data allowed more powerful tests 
to be developed using all observable properties of th e clus - 
ter phase-space distribution. Dressier & Shectman (198f) 
applied the test that will be described below to a sample 
of 15 clusters with an average of 73 measured redshifts per 
cluster. They found evide nce for substructure in ~ 40 per 
cent of cases. Bird (1994) has claimed that as many as 85 


per cent of well-studied clusters show some evidence for sub¬ 
structure. To some extent these disparate results are due to 
different definitions of substructure. Nevertheless, the fact 
that substructure has been detected even in the Goma clus¬ 
ter (Fitchett & Webster 1988; Mellier et al. 1988; White, 
Briel & Henry 1993, Colless & Dunn 1995, Biviano et al. 
1996), usually regarded as the archetypal relaxed rich clus- 
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Figure 9. The cumulative distribution of cluster velocity dispersions based on all our synthetic cluster catalogues. The solid squares 
show velocity dispersions derived from all galaxies projected within Va from the cluster centre and the open squares from galaxies within 
ra/3 (both using 3 (t clipping to remove interlopers). The triangles were computed from our catalogues using the den Hartog & Katgert 
interloper removal algorithm. The same algorithm was applied by Mazure et al. (1996) to a sample of 80 AGO clusters and is shown 
here as the solid circles. The open circles were derived from Bahcall & Gen’s (1993) mass function. The dashed line shows n(> 20 ) 

for clusters selected in 2D using an Abell criterion scaled to raj's. The “true” distribution, calculated from the velocity dispersion of the 
dark matter within 1.5/r“^Mpc is shown by the solid line. 


ter, indicates that inhomogeneities are more prevalent than 
at first thought. 

We can use our mock cluster catalogues to assess some 
of the most widely used methods for characterising substruc¬ 
ture and to test ways in which these methods may be used 
to improve cluster mass determinations from optical data. 
It should be noted that our detailed quantitative results 
will depend on our assumed cosmological model, particu¬ 
larly on our adopted value of 17 = 1. This choice leads to 
more prevalent substructure than models with a low value 
of 17 (Richstone et al. 1992, Mohr et al. 1995, Wilson et al. 
1996). Specifically, we consider here the algorithm proposed 
by Dressier & Shectman (1988) which attempts to identify 
dynamically distinct subunits within the cluster. This test 
is sensitive to local deviations of the mean velocity and ve¬ 
locity dispersion relative to the global values determined for 
the cluster as a whole. For each galaxy in the sample, a lo¬ 
cal mean velocity viocai and a local velocity dispersion aiocai 
are computed using the radial velocities of the galaxy itself 
and of its 10 nearest projected neighbours. The combined 
deviations are given by the quantity 

Si = ^ [{viocal - vf + {(^local “ of] ■ (H) 

The test statistic. A, is the mean value of Si averaged over 
all galaxies in the sample. 

We computed A for our mock clusters using the same 
galaxies that were considered when calculating velocity dis¬ 


persions in Section 5 (i.e. after Sn-clipping or the applica¬ 
tion of the den Hartog & Katgert algorithm). The results are 
illustrated in Figure 10 as a series of plots for the 12 clus¬ 
ters used as examples throughout this paper. Each galaxy is 
represented by a circle with radius proportional to exp((5i). 
Individual galaxies with discordant radial velocities occur 
in all fields. However, in a number of cases these discor¬ 
dant galaxies form distinct clumps, indicating the presence 
of coherent substructure. We determine the significance of 
the subclustering signal using Monte-Carlo simulations. For 
each cluster we create 1000 Monte-Carlo realizations by ran¬ 
domly reshuffling radial velocities, keeping the galaxy po¬ 
sitions, and therefore the local neighbour lists and radial 
profiles, fixed. The probability that the observed value of A 
reflects real substructure is given by one minus the fraction 
of these Monte-Carlo simulations that have a value of A in 
excess of the observed value. The percentage of clusters with 
detected substructure at various confidence levels is given in 
Table 1. These percentages are a little low er th an the val- 
ues fo und by both Dressier & Shectman (198f) and Bird 
(1994). Since the number of galaxies in our mock clusters is 
similar to that in the observational samples, this small dis¬ 
crepancy could perhaps reflect a bias towards exceptionally 
rich clusters in the observational samples, since the degree of 
substructure is expected to depend on cluster mass (Lacey 
& Cole 1993). 


In Figure 11 we plot cluster velocity dispersions, es- 
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Figure 10. The Dressler-Schectman test applied to mock clusters. The symbols show the projected positions of galaxies in our mock 
catalogue. The radius of each circle is proportional to exp5i, where Si is a measure of the local deviation of the radial velocity and 
velocity dispersion from the average values for the cluster as a whole. Foreground and background galaxies have been removed prior to 
applying the test using a 3cr-clipping procedure. 


Table 1. Confidence levels of Dressier &; Shectman test 


Confidence level 

100 % 

99% 

95% 

90% 

Simulated Clusters 

12.7% 

20.1% 

31.6% 

40.1% 

Bird (1994) 

24.0% 

24.0% 

44.0% 

52.0% 


timated in two different ways, against the value of the 
Dressler-Schectman statistic for each cluster. In the left 
panel the (2D) velocity dispersions were obtained using all 
galaxies within 1.5/i~^Mpc and after applying our standard 
3a clipping procedure (n.b. we use the same sample of galax¬ 
ies to compute both A and a^^ios)', in the right hand panel 
the (2D) dispersions were obtained using den Hartog & Kat- 
gert’s (1996) interloper removal algorithm. There is a strong 
correlation between a„^ 2 D and A in the left hand panel, con¬ 
firming our earlier conclusion from Figure 9 that much of 
the high (j„ tail in the n(> a^^ios) distribution is due to 
artificially large values produced by substructure. The cor¬ 
relation largely disappears if the den Hartog-Katgert algo¬ 
rithm is used, although as we noted above this algorithm 
tends to slightly underestimate all velocity dispersions. Our 
analysis confirms Bird’s (1995) view that cluster masses can 


be severely overestimated if substructure is not carefully re¬ 
moved. 


7 THE CLUSTER-CLUSTER CORRELATION 
FUNCTION 

The amplitude of the cluster-cluster correlation function, 
^cc, has been a subject of debate for several years, following 
the original claim by Bahcall & Soneira (1983) of a large 
clustering length, ro ~ 25h~^Mpc, for R>1 Abell clusters 
(where ro is defined by ^cc(ro) = 1.) Subsequent redetermi¬ 
nations for Abe ll samples have given similar, if somewhat 
smaller, values (Postman, Geller & Huchra 1992), but esti¬ 


mates from automated cluster catalogues give much smaller 


results, (rp ~ 11 —14h ^Mpc) (Efstathiou et al. 1992 


et al. 1992; Dalton et al. 1994). The latter are closer to 
(although slightly higher than) the predictions o f the stan¬ 


Micho] 


1996a). 


dard C DM model (White et al. 1987; Croft 1994; Eke et al 


One of the explanations that has been put forward for 
the unexpectedly large value of ro in the Abell sample is 
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Figure 11. The correlation between velocity dispersion estimates and the strength of the substructure signal as measured by Dressier Sz 
Schectman’s A parameter. In the left panel, foreground and background galaxies were removed using an iterative 3(7 clipping procedure. 
On the right, the den Hartog-Katgert algorithm (see text for details) was applied to galaxies with a projected separation of less than 
lh“^Mpc from the cluster centre. 
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Figure 12. The two-point correlation function of clusters in red- 
shift space. The open triangles show the estimate for R> 1 Abell 
clusters by Bahcall & Soneira (1983). The open circles show the 
correlation function for clusters found in 3D with a mean inter¬ 
cluster separation of 50/i“^Mpc. The solid and open squares show 
results for clusters identified in 2D, with the poorer of a pair sep¬ 
arated by less than 3h“^Mpc (solid squares) or 1.5h“^Mpc (open 
squares) omitted from the sample. 


Dekel et al. 198£ 


Efs- 


tathiou et al. 1992). The apparent richness of sub-threshold 
groups projected close to the line-of-sight to a rich clus¬ 
ter may be overestimated, and the groups promoted to the 
R>1 class. This would introduce spurious close pairs into the 
sample leading to artificially high estimates of ^cc and to dis¬ 
tortions in the correlation function measured as a function 
of the separation along the line-of-sight. Sutherland (1988) 
and Sutherland & Efstathiou (1992) argued that these dis¬ 
tortions are present in the Abell cluster sample and, by ap¬ 
plying an empirical correction for this effect, derived a low 
value of ro « 13/i“^Mpc for the Bahcall & Soneira sample 
of Abell clusters. 

We can assess the importance of projection effects in 
the determination of ^cc for Abell clusters by comparing es¬ 
timates for our simulated samples of clusters identified in 
2D and 3D respectively. Since the cluster correlation func¬ 
tion depends on cluster r ichness, or equivalent l y on the mean 
inter-cluster separat ion ([Bahcallfc^CerDlT93 ; Mann, Heav¬ 


ens & Peacock 199f; Croft 1994; Eke et al. 1996aD , we com- 


pare mock Abell cluster samples with a sample of the rich¬ 
est clusters identified in the 3D mass distribution, with a 
similar mean inter-cluster separation, D — 50h~^Mpc. A 
complication arises from an ambiguity in the way in which 
overlapping clusters should be treated. Unfortunately, Abell 
(1958) did not specify what action he took when two clus¬ 
ters appeared to overlap on the sky. We have considered two 
cases, one in which the poorer of two clusters that have a 
separation of less than 3/i“'^Mpc is excluded (i.e. clusters 
cannot have overlapping Abell radii) and one in which this 
restriction is relaxed to 1.5h“^Mpc (i.e. a cluster centre can¬ 
not lie within the Abell radius of another cluster.) Since this 
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second criterion is less restrictive, 12 per cent more clusters 
make it into this sample. 

In Figure 12 we compare the redshift-space correlation 
functions of clusters identified in 2D (open and solid squares) 
and 3D (open circles). The correlation strength in the 2D 
sample in which overlapping Abell radii are allowed exceeds 
that of the 3D sample on all scales, but the difference is only 
large at small pair separations. As a result, the clustering 
length of this 2D sample is only about 10 per cent larger 
than the value, ro — 12.2h~^Mpc, for the sample identified 
in 3D. This is still significantly smaller than the clustering 
length estimated for Abell clusters by Bahcall & Soneira 
(1983) (see Figure 12). Thus, we conclude that projection 
effects are not sufficient to enhance the correlation function 
of clusters in the standard CDM cosmology to the amplitude 
estimated by Bahcall & Soneira. We note, however, that 
our mock Abell cluster sample does not exhibit the strong 
distortions in the correlation function along the line-of-sight 
present in Bahcall & Soneira’s sample. The enhancement in 
the correlation function of the 2D sample is reminiscent of 
the effect suggested by Sutherland (1988) but the strength 
of this effect appears to be weak. 


8 DISCUSSION AND CONCLUSIONS 

We have investigated two sources of uncertainty in the con¬ 
struction and analysis of cluster catalogues selected from 2D 
galaxy maps. The first is the influence of projection effects 
on the identification and richness classification of clusters. 
The second is the influence of local subclustering on esti¬ 
mates of cluster velocity dispersions. We have found that 
both these effects are large and compromise the use of clus¬ 
ter properties as cosmological diagnostics. 

Our conclusions are based on the analysis of mock clus¬ 
ter catalogues. Although the clusters themselves are iden¬ 
tified in 2D galaxy maps using criteria patterned on those 
employed in real catalogues, in the simulations we have ac¬ 
cess to full 3D spatial and velocity information for the model 
galaxies. These were identified using the high peak model of 
biased galaxy formation (the peak-background split tech¬ 
nique) in an fl = 1 standard CDM universe with primor¬ 
dial power spectrum normalized so as to obtain the correct 
abundance of rich Abell clusters. The resulting galaxy au¬ 
tocorrelation function is similar to the observed one in the 
relevant range of separations 2 < r//i“^Mpc < 10. Clus¬ 
ters were selected from volume-limited galaxy samples that 
are complete above the luminosity required to select Abell 
clusters over the entire volume. Because the box length in 
our simulations is smaller than the effective path length to 
a moderately distant Abell cluster, our procedure underes¬ 
timates the degree of contamination by distant background 
galaxies. Our conclusions regarding the completeness and 
contamination of Abell’s cluster catalogue do not depend 
sensitively on the specific cosmological assumptions which 
we have chosen for our simulations. On the other hand, our 
conclusions regarding the effects of subclustering on esti¬ 
mates of cluster velocity dispersions, could well be sensitive 
to our adopted model, particularly to the assumption that 
17 = 1, since the degree of substructure in clusters depends 
on the value of Q (Richstone et al. 1992; Mohr et al. 1995; 


Wilson et al. 1996). We intend to investigate this issue in a 
subsequent paper. 

Most of the clusters identified as rich clusters in our 
mock catalogues are associated with at least one large galaxy 
concentration along the line-of-sight. However, in almost ex¬ 
actly one third of cases, clusters classified as having R> 1 
do not correspond to a galaxy concentration which satisfies 
the required criterion in three-dimensions. Instead, the ap¬ 
parent richness of these clusters is boosted above threshold 
by the alignment of several smaller clumps along the line- 
of-sight. For about half of these spurious R> 1 clusters, the 
main concentration along the line-of-sight fails even to meet 
the criterion of thirty galaxies required for an R= 0 cluster. 
In a small number of cases (5 per cent) the second largest 
clump along the line-of-sight satisfies the R> 1 richness cri¬ 
terion in its own right. These clusters should have been in¬ 
cluded as separate entries in a complete catalogue. The main 
source of incompleteness, however, is clusters that are suf¬ 
ficiently rich, but which are missed because of a downward 
fluctuation in the number of background galaxies. Note that 
whereas our determination of the background is based on the 
mean number of galaxies expected along each line-of-sight, 
Abell’s determination was based on a local estimate which 
is not fully explained in his original paper. Our results show 
that Abell’s catalogue is neither homogeneous nor complete 
above a uniform luminosity limit. 

Although our simulations are the first to quantify the 
importance of projection effects on Abell’s catalogue, the 
existence of such effects has been suspected for a long time. 
For this reason, it has often been thought that more homo¬ 
geneous and complete catalogues could be constructed by 
selecting clusters according to the X-ray luminosity of their 
intra-cluster medium. Because the X-ray emission is cen¬ 
trally concentrated, projection effects are likely to be much 
less severe in this case. Our simulations give partial sup¬ 
port to this view. Although they do not follow the evolution 
of the X-ray emitting gas, ‘X-ray luminosities’ can be as¬ 
signed to clusters in a plausible way that is consistent with 
the results of N-body/hydrodynamic simulations. We find 
that although many of the spurious optical identifications 
are avoided by applying an X-ray luminosity criterion, pro¬ 
jection effects are not completely eliminated. For example, 
in 16 per cent of cases, the second most luminous cluster 
projected along the line-of-sight to another cluster is bright 
enough to have been detected in the survey of Briel & Henry 
(1993) even though, in practice, it is unclear whether these 
secondary peaks would have been identified as separate clus¬ 
ters. 

The abundance of galaxy clusters is a fundamental cos¬ 
mological diagnostic which has been used, in various forms, 
to determine the amplitude of mass fluctuations (Frenk et al. 
1990, Evrard 1989, White et al. 1993, Viana & Liddle 1996, 
Eke et al. 1996b), to rule out specific cosmological mod¬ 
els (Bahcall & Cen 1993; Lubin et al. 1996), or to provide 
an estimator of 17 (Eke et al. 1996b). Our analysis of pro¬ 
jection effects shows that if the abundance is characterized 
by the distribution of velocity dispersion (or by the mass 
distribution inferred from it), then results based on Abell’s 
cluster catalogue are extremely unreliable. Abell’s cluster 
catalogue is so strongly contaminated that all estimators of 
velocity dispersion, even elaborate ones like that of den Har- 
tog & Katgert (1996), produce spuriously large values. For 
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example, a standard So- clipping procedure applied only to 
galaxies close to the cluster centre overestimates the true 
abundance of clusters with (T„,ios = 1500 kms~^ by more than 
an order of magnitude. Tests that rely on the high mass tail 
of the distribution derived from optical data are therefore 
particularly unreliable. Since subclustering always tends to 
inflate the estimated velocity dispersion, low estimates of 
o'v,ioa are generally more reliable than large ones. For ex¬ 
ample, the estimator mentioned above leads to a reasonable 
correlation between estimated and true values for (Jv,ios^ 
850 kms“^, but to no correlation at all beyond that. As a 
result, the median velocity dispersion of the cluster distri¬ 
bution is reasonably well determined from the data (White 
et al. 1993). 

Our simulations indicate that increasing the number of 
galaxies used to estimate dispersions will not, in general, 
produce much improvement. (Dispersions for the most mas¬ 
sive of our simulated clusters were typically measured using 
over 100 galaxies.) This is because the source of the problem 
is the presence of genuine substructure which is not elimi¬ 
nated by including more galaxies. A better way to improve 
upon estimates based on optical data is to use new cluster 
catalogues such as the APM catalogue which, by virtue of 
assuming a smaller search radius than Abell’s, already elimi¬ 
nate much of the contamination. Even in this case, however, 
tests such as those presented here are necessary to assess the 
extent of any remaining biases present in the data. 
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